首页> 外文OA文献 >A Stochastic Finite-State Word-Segmentation Algorithm for Chinese
【2h】

A Stochastic Finite-State Word-Segmentation Algorithm for Chinese

机译:汉语随机有限状态分词算法

摘要

We present a stochastic finite-state model for segmenting Chinese text intodictionary entries and productively derived words, and providing pronunciationsfor these words; the method incorporates a class-based model in its treatmentof personal names. We also evaluate the system's performance, taking intoaccount the fact that people often do not agree on a single segmentation.
机译:我们提出了一种随机有限状态模型,用于将中文文本分割成字典条目和生产性衍生词,并为这些词提供发音。该方法在处理个人姓名时采用了基于类别的模型。我们还考虑了人们通常在单个细分市场上不一致的事实,从而评估了系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号